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A portion of the disclosure of this patent document contains command formats 
and other computer language listings, all of which are subject to copyright protection. 
The copyright owner, EMC Corporation, has no objection to the facsimile reproduction 
by anyone of the patent document or the patent disclosure, as it appears in the Patent and 
5 Trademark Office patent file or records, but otherwise reserves all copyright rights 
whatsoever. 

Field of the Invention 

The invention relates generally to management of software program code 
10 creation, and more particularly to a system and method for managing the risk that code 
created may later be interpreted erroneously. 

Background of the Invention 

15 Computer systems employ microprocessors that interpret software or program 

code instructions extremely fast. Rapid decisions and results follow in split-second 
chain reaction fashion. Sometimes the program code being interpreted may be read 
incorrectly, due to electronic noise for example, and in such a case an error may be 
flagged which slows down the computers operation. On the other hand the code may be 

20 interpreted to mean something other than that which it really means and an erroneous 
action results, which in turn may lead to other such erroneous actions. In any case such 
misinterpreted code will typically lead to expensive delays and costs associated with 
service calls and possibly repairs. 
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Sometimes service calls are called due to erroneously flagged errors. At the time 

of service, it may not be apparent that the service call is due to an error. The error 

message may then dictate that a component such as an electronics board or hard disk 

drive be replaced. In such a case the removed component may be later tested to 

5 determine its cause of failure. But the testing would probably reveal no trouble found. 
In this case the component would be destroyed or scrapped as its quality reputation has 
been severely damaged even when it in fact is fine. In this case, there would have been 
costly delays to normal production because of the flagged errors, a costly service call, an 
expensive repair and removal of a component that was probably never flawed in the first 

10 place. Moreover, the original problem(s) probably are still present in the system and are 
ready to repeat. 

The expensive problems described above may be even more costly in a modern 
system used for data storage and retrieval. As is known in the art, such computer systems 
generally include a central processing unit (CPU), a memory subsystem, and a data 

15 storage subsystem. According to a network or enterprise model of the computer system, 
the data storage system associated with or in addition to a local computer system, may 
include a large number of independent storage devices or disks housed in a single 
enclosure or cabinet. This array of storage devices is typically connected to several 
computers over a network or via dedicated cabling. Such a model allows for the 

20 centralization of data that is to be shared among many users and also allows for a single 
point of maintenance for the storage functions associated with the many host processors. 
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The data storage system stores critical information for an enterprise that must be 
available for use substantially all of the time. If an error occurs on such a data storage 
system it must be fixed as soon as possible because such information is at the heart of the 
commercial operations of many major businesses. A recent economic survey from the 
5 University of Minnesota and known as Bush-Kugel study indicates a pattern that after just 
a few days (2 to 6) without access to their critical data many businesses are devastated. 
The survey showed that 25% of such businesses were immediately bankrupt after such a 
critical interruption and less than 7% remained in the marketplace after 5 years. 

As has been described above, it is important to focus error recovery procedures on 
10 systems that actually require it and extremely wasteful to focus such efforts on systems 
not actually in need of repair. The same that is true for simple computer systems is true 
for data storage systems but magnified in importance and scale of potential costs. 

Accordingly there is a need in the general computer arts and the more specific 
data storage and retrieval arts to manage error handling so that false errors are less likely 
15 to occur and so that program code is more likely to be interpreted as it is written instead 
of as another code message that may be similar. 

Summary of the Invention 

20 The present invention is a system and method for managing a risk that 

interpretation of program code may be faulty. The method includes the steps of 
establishing a criterion denoted as a code difference criterion, and applying the code 
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difference criterion to arrive at a set of predetermined size of members that are acceptable 

codes that satisfy the difference criterion. 

This invention allows management of the risk that errors in interpreting program 

code may lead to costly problems. A code difference criterion related to code value and 
5 position is used to lower such risk. Satisfying the criterion ensures that each of the 

possible program codes will be sufficiently different from all other possible program 

codes such that non-catastrophic system problem will not render them identical. 

One embodiment of this invention includes a method for managing software 

program code for risk related to errors associated with interpretation of such program 
10 code by determining acceptable program codes to reduce such errors. This method 

involves establishing a criterion denoted as a code difference criterion for determining 

one or more acceptable program codes. A candidate program code and a set of other 

candidate program codes having a predetermined number of member candidate program 

codes are selected. Under program control the system determines whether the candidate 
15 program code is an acceptable program code based on a comparison of the candidate 

program code to each member of the set of other candidate program codes in accordance 

with whether the code difference criterion is satisfied. 

In general a difference criterion of a predetermined number of bits difference of 

code must be satisfied. For example, if the predetermined number is 2 then satisfying 
20 the difference criterion requires that every code will have at least 2 bits different from 

any other code. 
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In an alternative embodiment, the invention includes a system for carrying out 

method steps. In another alterative embodiment, the invention includes a program 

product for carrying out method steps. 



5 Brief Description of the Drawing s 

The above and further advantages of the present invention may be better under 
stood by referring to the following description taken into conjunction with the 
accompanying drawings in which: 
10 Fig. 1 is a block diagram of the preferred architecture of the invention and 

including a schematic of the logic of this invention; 

Fig.2 is an exemplary representation of a computer-readable medium encoded 
with the logic of Fig. 1 for enabling the method of the present invention; 
Fig. 3 is an overview of the method steps of this invention; 
15 Fig. 4 is a flow logic diagram illustrating some of the method steps of Fig. 3 that 

are carried out by the logic of this invention of Fig. 1 ; 

Fig. 5 is another flow logic diagram illustrating other method steps also carried 
out by the logic of Fig. 1; and 

Fig. 6 is another flow logic diagram illustration other method steps also carried 
20 out by the logic of Fig. 1 . 



Detailed Description of the Preferred Embodiment 
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The methods and apparatus of the present invention are intended for use in 

general purpose digital computers that become specific purpose digital computers under 

the construct of the inventive logic of this invention. The methods enabled by such logic 

may be useful with data storage systems, such as the Symmetrix Integrated Cache Disk 

5 Array system available from EMC Corporation of Hopkinton, MA. 

The methods and apparatus of this invention may take the form, at least partially, 
of program code (i.e., instructions) embodied in tangible media, such as floppy diskettes, 
CD-ROMs, hard drives, random access or read only-memory, or any other machine- 
readable storage medium. When the program code is loaded into and executed by a 

10 machine, such as a computer, the machine becomes an apparatus for practicing the 

invention. The methods and apparatus of the present invention may also be embodied in 
the form of program code that is transmitted over some transmission medium, such as 
over electrical wiring or cabling, through fiber optics, or via any other form of 
transmission. And may be implemented such that herein, when the program code is 

15 received and loaded into and executed by a machine, such as a computer, the machine 
becomes an apparatus for practicing the invention. When implemented on a general- 
purpose processor, the program code combines with the processor to provide a unique 
apparatus that operates analogously to specific logic circuits. 

The logic for carrying out the method is embodied as part of the system described 

20 below beginning with reference to Fig. 1 . One aspect of the invention is embodied as a 
method that is described below with detailed specificity in reference to Figs. 3-6. 
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Fig. 1 shows an overall system 100 employing the Logic 106 that carries out the 
method steps of the invention. A computer system 102, which may be a general-purpose 
digital computer transformed into a special purpose computer by implementation of 

5 Logic 106 residing in memory element 104, which may be conventional electronic 

memory, e.g. RAM. A processor 108, such as a well-known Intel microprocessor carries 
instructions in a conventional manner and allows the Logic 106 to operate in the 
computer system, perhaps under control of an operating system (not shown). 

Reference is made now to Figs. 1 and 2. Logic 106 is preferably implemented as 

10 software, and may be in the well known C or C ++ programming language. Of course, 
the Logic could be implemented in hardware as well. Computer System 102 may be in 
communication with an input device 112 (e.g. a mouse or keyboard) or output device 110 
(e.g. a display or printer). The results of running Logic 106 may be useful on a target 
system, e.g. a Data Storage System, such as the EMC Symmetrix Integrated Cache Disk 

1 5 Array available from EMC of Hopkinton, MA. Logic 1 06 may be implemented on a 
computer-readable medium, e.g. a CD ROM or Floppy Disk as shown in Fig. 2. 

Fig. 3 shows an overview of the method steps carried out by Logic 106 in system 
100 (Fig. 1). The method of this invention employs a method using a criterion that must 
be satisfied before a code may be used. The criterion is denoted as a Difference 

20 Criterion. Employing the Difference Criterion generally reduces the risk of causing the 
types of problems described below (example shown in Table 1). The Difference 
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Criterion is preferably structured so that in order to satisfy it every code must have at 

least 2 bits that are different from any other code that may also be used. 

Step 200 allows a user to establish a code Difference Criterion. In general a 

difference criterion of a predetermined number of bits difference of code must be 
5 satisfied. For example, if the predetermined number is 2 then satisfying the difference 

criterion requires that every code will have at least 2 bits different from any other code. 

In such a case, the Difference Criterion may be short handedly referred to as requiring a 

"minimum 2 bits difference." 

Referring again to Fig. 3, a user would then need to determine the number (#) of 
10 distinct codes needed (DCN), i.e. the number of codes that must have different values. 

The choice for DCN typically depends on a plethora of circumstances leading to a design 

choice. For example, a system may need 100 different error messages, and 400 other 

different types of messages, in which case the value of DCN would be 500. The values 

of each candidate code would then have to be compared to determine if the Difference 
15 Criterion was satisfied (Step 204). This would be continued until enough codes were 

found that satisfied DCN. 

Without employing the present invention, prior art schemes act as follows. The 

number of bits (lines) x, needed for the generation and transmission of codes is based on 

the solution of the equation: 
20 2 X = N {Equation 1), where N is the number of distinct codes required. 
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The number of distinct codes needed is typically a predetermined design choice based on 
how many situations needed to be handled. For example a distinct code might be needed 
for each error message, for each instruction to be handled, and so forth. 

For example, if N was determined to be 6, then Equation 1 will yield a value of x 
5 of 3. The 6 codes may be selected from the possible 8 shown below in Table 1 : 
Table 1 : Example of acceptable codes not using Difference Criterion 



Decimal Value 


Binary Code Value 


0 


000 


1 


001 


2 


010 


3 


011 


4 


100 


5 


101 


6 


110 


7 


111 



As can be seen from this example, some of the 6 codes chosen will only be one 
1 0 distinct bit different from some of the other codes. But the inventor has critically 
recognized that this can lead to a disadvantage that is solved by this invention. The 
disadvantage is that any type of intermittent electronic noise operating on the signal line 
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carrying the code can change its binary value, by for example changing a 1 to a 0 or vise- 
versa. 

Illustrating further this example, if the decimal value of 2, which has the binary 
value of 01 0 is changed by a noise glitch to 01 1 or 1 10 then it will be erroneously 

5 interpreted as either decimal 3 or decimal 6, respectively. Such an interpretation might 
lead to flagging an error when the underlying condition that would lead to such an error is 
not present, or otherwise flagging an error incorrectly. This would lead to the types of 
costly problems described above. 

The method of this invention employs a method using a criterion that must be 

10 satisfied before a code may be used. The criterion is denoted as a Difference Criterion. 
Employing the Difference Criterion generally reduces the risk of causing the types of 
problems described above with reference to the example of Table 1. The Difference 
Criterion is preferably structured so that in order to satisfy it every code must have at 
least 2 bits that are different from any other code that may also be used. 

15 The Logic 106 of this invention employs the Difference Criterion to any candidate 

line of code to determine its suitability in a relatively automated fashion. Employing such 
a Difference Criterion for a requirement of 7 distinct codes would yield the following 
dispositions as shown in Table 2. 



11 



Patent Application 
Docket Number: EMC-01-187 



Applicant: Rubinstein 
EMC CONFIDENTIAL 



Table 2: Example of a requirement for 7 distinct codes using Difference Criterion 



Decimal Value 


Binary Code Value 


Disposition 


1 


00001 


TRUE 


2 


00010 


TRUE 


3 


00011 


FALSE 


4 


00100 


TRUE 


5 


00101 


FALSE 


6 


00110 


FALSE 


7 


00111 


TRUE 


11 


01011 


TRUE 


16 


10000 


TRUE 


22 


10110 


TRUE 



5 TRUE means that the Difference Criterion is correctly satisfied. 
FALSE means that the Difference Criterion is not correctly satisfied. 

So in the example of Table 2, the following decimal values are acceptable: 
1,2,4,7, 11,16, and 22, wherein the values of 3, 5, and 6 are rejected as being non- 
10 acceptable because they do not satisfy the Difference Criterion. Each of the acceptable 
codes satisfies the minimum 2-bits difference required (for the exemplary case). 

12 



Patent Application 
Docket Number: EMC-01-187 
Applicant: Rubinstein 
EMC CONFIDENTIAL 

In general, one embodiment for carrying out the method of Fig. 3 includes a steps 
for managing software program code for risk related to errors associated with 
interpretation of such program code by determining acceptable program codes to reduce 
such errors. This method involves establishing a criterion denoted as a code difference 

5 criterion for determining one or more acceptable program codes. A candidate program 
code and a set of other candidate program codes having a predetermined number of 
member candidate program codes are selected. Under program control the system 
determines whether the candidate program code is an acceptable program code based on 
a comparison of the candidate program code to each member of the set of other candidate 

10 program codes in accordance with whether the code difference criterion is satisfied. Such 
an embodiment is described in detail with reference to Figs. 4-6. 

This detailed description of the method steps now follows with reference to flow- 
logic diagrams in Figs. 4 and 6. Using these diagrams, one skilled in the art may create a 
suitable computer program for carrying out computer-executed steps. Such a program 

15 may be written in any computer-executable language, but C or C++ is a good choice. 

Referring to Fig. 4, Step 208 allows a user to use input device 1 12 to input a value 
for DCN, and the Logic then sets the DCN counter to 1 . Next a Current Code value is set 
equal to decimal value 1 (binary 00001). Although the illustrations for the preferred 
embodiment are in binary, one skilled in the art will recognize the invention to be 

20 applicable to any numerically based system, such as a hexadecimal or another numerical 
system. In step 212, the Current Code is then compared to the Next Candidate Codes 
(Next Codes), which can be arrived at in a number of ways. A good choice is to take the 
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DCN times the number of different bits sought to arrive at some group of Next Codes that 
is likely to produce a sufficient set of acceptable Codes (equal to at least the DCN value). 
In the event that DCN times the number of different bits sought does not yield a large 
enough set, one skilled in the art will recognize that the set can be made larger by 

5 choosing a larger multiplier, such as the number of different bits sought+1 . Step 214 is a 
connecting step "A" that flows into the logic diagrams shown in Fig. 5. 

Referring to Fig. 5, step 218 increments the Current Code by 1 . A determination 
is made in step 220 to see if the Difference Criterion is satisfied for the Current Code (as 
compared to each of the Next Codes produced by creating a set of the size DCN times the 

10 number of different bits sought, wherein each member of the set is a code having a 

decimal value from 1 to DCN times the number of different bits sought. Naturally there 
will be an overlap in identity with the Current Code itself but it is understood that this 
case can be excluded from consideration when determining if the Current Code satisfies 
the Difference Criterion. Returning to the method steps, if the Difference Criterion is not 

15 satisfied for every comparison with each Next Code (other than one that is identical) then 
the answer is "No." In this case the candidate is rejected in step 222. Processing flows 
again to logic step 216 (connecting step B), which in turn returns processing back to the 
diagram illustrated in Fig 4, and flows into step 212. If on the other hand, the Difference 
Criterion is satisfied then processing flows to step 224 and connecting step C (flows to 

20 Fig. 6). 

Only those codes satisfying the Difference criterion are stored for use, shown in 
step 226 of Fig. 6. In step 228 the value of the DCN counter, depicting the number of 
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acceptable codes found, is compared to the desired DCN. If the amounts are equal then 

processing stops in step 234 and the acceptable codes are outputted to display or printed 

or otherwise passed, such as to Target System 114. If the amounts are not equal then the 

Current Code is incremented by 1 in step 230 and a new search loop is initiated at step 

5 2 12. If the entire block of Next Codes was used as Current Code and the DCN was not 
satisfied, the definition of the Next Codes block counter is incremented by 1 in step 245. 
All counters are cleared and the entire process returns to the diagram illustrated in Fig. 4 
and in turn step 210. 

Processing continues so that each code is compared to each other code so the 

10 value of each one is at least 2 bits in difference from each other and the DCN counter is 
equal to the value of the DCN. When this occurs then processing stops in step 234 and 
the acceptable codes are outputted to display or printed or otherwise passed, such as to 
Target System 114. 

Another way of stating the requirement of the preferred implementation of the 
15 Difference Criterion is that it requires that the acceptable program code have a binary 
equivalent that when compared to a binary equivalent of each member of the set of other 
candidate program codes has a minimum 2-bits difference from each member. (See for 
example Table 2). 

In general, a plurality of acceptable program codes are determined based on a 
20 plurality of comparisons of a plurality of candidate program codes to a plurality of sets of 
other candidate program codes. Each candidate program code of the plurality of 
candidate program codes is selected from a universal group consisting of the first 
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candidate program code and the first set of other candidate program codes. Also each 

member of each set of other candidate program codes of the plurality of other candidate 

program codes is also selected from this universal group. 

One skilled in the art will realize that there are many ways to accomplish the type 

5 of repetitive yet iterative looping, comparing, and adjusting that must be performed to 
carry out the method steps. Although not limited to such an embodiment, the inventor 
has recognized that a use of array indexed loops, sometimes referred to as "For next i," 
and "For next j," loops used in a language such a C or C++ are satisfactory. For 
comparing the values of each bit itself it may be useful to employ shift registers. One 

10 may also choose to not subject any Code found satisfying the Difference Criterion, but 
then one must store this value and account for it in subsequent comparisons. Depending 
on available resources and lines of code one is willing to write, one may choose to allow 
each subsequent comparison to have some repetition and include such a Code in 
subsequent comparisons. Nevertheless, this aspect is a design choice, and one skilled in 

15 the art will recognize that this invention is not limited to any such implementation. 

A system and method has been described for managing the risk of errors due to 
faulty interpretation of program code. Having described a preferred embodiment of the 
present invention, it may occur to skilled artisans to incorporate these concepts into other 
embodiments. Nevertheless, this invention should not be limited to the disclosed 

20 embodiment, but rather only by the spirit and scope of the following claims and their 
equivalents. 
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